Modelling Accents for Automatic Speech Recognition

نویسندگان

  • Maryam Najafian
  • Martin Russell
چکیده

Accent is cited as an issue for speech recognition systems. If they are to be widely deployed, Automatic Speech Recognition (ASR) systems must deliver consistently high performance across user populations. Hence the development of accentrobust ASR is of significant importance. This research investigates techniques for compensating for the effects of accents on performance of Hidden Markov Model (HMM) based ASR systems. Recently, HMM systems based on Deep Neural Networks (DNNs) have achieved superior performance to more traditional systems based on Gaussian Mixture Models (GMMs), due to the discriminative nature of DNNs. Our research confirms, this by showing that a DNN system outperforms the GMM system even after an accent-dependent acoustic model was selected using Accent Identification (AID), followed by speaker adaptation. The average performance of the DNN system over all accent groups is maximized when either accent diversity is highest, or data from “difficult” accent-groups is included in the training set.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Recognition of South African English Accents

Declaration By submitting this thesis electronically, I declare that the entirety of the work contained therein is my own, original work, that I am the sole author thereof (save to the extent explicitly otherwise stated), that reproduction and publication thereof by Stellenbosch University will not infringe any third party rights and that I have not previously in its entirety or in part submitt...

متن کامل

Acoustic modelling of English-accented and Afrikaans-accented South African English

In this paper we investigate whether it is possible to combine speech data from two South African accents of English in order to improve speech recognition in any one accent. Our investigation is based on Afrikaans-accented English and South African English speech data. We compare three acoustic modelling approaches: separate accent-specific models, accentindependent models obtained by straight...

متن کامل

Towards a Localised German Automatic Speech Recognition

Spoken languages are often rich in regional accents and dialects. These local variations often pose challenges to automatic speech recognition. In this study, we analyse the influence of German regional accents on the performance of a large vocabulary continuous speech recogniser trained on standard German data. The experiments show a large variation in the error rate over different regions. We...

متن کامل

Accent Classification among Punjabi , Urdu , Pashto , Saraiki and Sindhi Accents of Urdu Language

Automatic Speech Recognition (ASR) is a key component in Human Computer Interaction (HCI) applications. Stability of ASR systems largely depends on accent, gender, age of speakers, background noise and channel variations. In this paper, a study has been conducted to classify five different accents of Urdu language spoken in Pakistan i.e. Punjabi, Urdu, Pashto, Saraiki and Sindhi. Speech data ha...

متن کامل

Signal-based accent and phrase marking using the fujisaki model

Automatic prosodic marking is very important in speech signal processing, since its results are required in many subsections, e.g. speech synthesis and speech recognition. The most important prosodic features on the linguistic level are the marking of accents and phrases. In this paper, we develop an automatic algorithm for marking accents and phrases which analyzes the F0 contour by using the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015